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SYSTEMS AND METHODS FOR SEPARATING 
MULTIPLE SOURCES USING DIRECTIONAL FILTERING 

Background of the Invention 

[0001] The present invention relates to systems and 

5 methods for processing multiple sources, and more 
particularly to separating the sources using 
directional filtering . 

[0002] There may be instances in which there are 

several sources emitting signals. The combination of 

10 these sources typically forms a composite signal (e.g., 
a signal representing a mixture of these sources) that 
may be received by a sensor. While there are many 
applications for the received composite signal, such as 
amplification, it is sometimes desirable to selectively 

15 isolate or separate sources in the composite signal. 
This problem of separating sources is sometimes 
referred to as the "cocktail party problem" or "blind 
source separation. " 

[0003] For example, in an acoustic environment, 

2 0 hearing aids may be used to amplify sounds for the 
benefit of the user. However, because hearing aids 
receive all sound impinging on its receiver, it 
amplifies desired sounds (e.g., conversation) and 
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undesired sounds (e.g., background noise). Such 
amplification of all received sounds may make it more 
difficult for the user to hear. Therefore, hearing 
aids have been designed to filter out background noise 
5 (e.g., undesired sources) while allowing speech and 

other sounds (e.g., desired sources) to pass through to 
the user. One way to accomplish this is to separate 
the sources of sound being received by the hearing aid, 
reconstruct the desired sources, and transmit the 
10 reconstructed sources to the user. 

[0004] As another example, source separation may be 
used to separate radio signals being emitted by 
different transmitters . 

[0005] Several approaches have been undertaken to 

15 separate sources through the use of machines, 

mathematical models, algorithms, and combinations 
thereof, but these approaches have achieved limited 
success or are bound by restrictive operating 
conditions. Some approaches require use of multiple 

20 sensors (e.g., microphones) in order to separate 
sources. Such an approach relies on the relative 
attenuation and delay from each source as received by 
the multiple sensors. Use of multiple sensors is 
described, for example, in U.S. Patent Nos . 6,526,148 

25 and 6,317,703. Although these multiple sensor 

techniques may be used to separate sources, they fail 
when used in connection with a single sensor. 
[0006] Single sensor source separation techniques 

have been attempted, such as those described in the 

3 0 Journal of Machine Learning Research (hereinafter 

11 JMLR" ) , Vol. 4, 2003, and in particular, pages 1365- 
1392, and in Advances in Neural Information Processing 
Systems (hereinafter "ANIPS"), Vol. 13, 2001, and in 
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particular, pages 793-799, but these techniques require 
detailed knowledge of the sources and fail to use 
directional filtering as a cue in performing source 
separation. 

5 [0007] While existing machine/algorithm combinations 
strive to achieve source separation, organisms on the 
other hand, such as mammals, have an innate ability to 
distinguish among many different sources, even when 
placed in a noisy environment. The auditory processing 

10 functions of an organism's brain separate and identify 
which sounds belong to which sources. For example, a 
person placed in a noisy environment may hear many 
different types of sounds, yet still be able to 
identify the source (e.g., the radio, the person 

15 talking, etc.) of each of these sounds. 

[0008] Organisms accomplish source separation by 

localizing sound sources using a variety of binaural 
and monaural cues. Binaural cues can include intra - 
aural intensity and phase disparity. Monaural cues can 

20 include directional filtering. Directional filtering 
is typically performed by the organism's ears. That 
is, the ears "directionalize" sounds based on the 
location from which the sounds originate. For example, 
a "bop" sound originating from the front of a person 

25 sounds different from the same "bop" sound originating 
from the right side of a person. This is sometimes 
referred to as the "head and pinnae" relationship, 
where the head is the sensor and the pinnae is the 
location of the source. These differences in sound, 

3 0 depending on the location in which the sound source is 
located, are used as spatial cues by the organism's 
auditory system to separate the sources. In other 
words, the ears directionalize each source based on its 
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location and transmit the directionalized (e.g., 
filtered) sound information to the brain for use in 
source separat ion . 

[0009] Therefore, it is an object of the invention 

5 to provide systems and methods that overcome the 

deficiencies of the aforementioned source separation 
techniques and that utilize directional filtering to 
accurately and quickly separate sources. 
[0010] It is another object of the invention to 

10 separate sources using just one sensor. 

Summary of the Invention 

[0011] These and other objects of the invention are 

accomplished by providing systems and methods that use 
directional filters to perform source separation. The 

15 composite signal received by the sensor can be 

characterized mathematically to represent the sum of 
the filtered sources. Each source can be represented 
mathematically as the weighted sum of basis waveforms, 
with the weights (coefficients) being sufficient to 

20 characterize the source. The basis waveforms can be 

filtered, so the same coefficients represent the source 
before and after the transformation between the 
transmitter and the sensor, using a different set of 
basis waveforms. The transformation itself, is based 

25 on, for example, the location of the source, the 

environment (e.g., a small room as opposed to a large 
room) , reverberations, signal distortion, and other 
factors . 

[0012] The directional filters are used to 

30 approximate these transformations. More particularly, 
directional filters may be used to generate signal 
dictionaries that include a set of filtered basis 



- 5 - 



signals. Thus, when the composite signal is received, 
source separation is performed using the composite 
signal and the signal dictionary to estimate the value 
of the coefficients. The estimated value of the 
5 coefficients is used to selectively reconstruct one or 
more sources contributing to the composite signal. 
[0013] Two different "types" of reconstructed 

sources can be obtained in accordance with the 
invention. One type refers to source reconstruction of 

10 sources received by the sensor. Hence, this "sensor 
type" reconstruction reconstructs sources that have 
undergone transformation. Another type refers to 
source reconstruction of sources being emitted 
substantially directly from the source itself. This 

15 "source type" reconstruction reconstructs sources that 
have not undergone a transformation. Source type 
reconstructed sources are "de-echoed." 

[0014] An advantage of the invention is that source 

separation can be performed with the use of just one 

20 sensor. The elimination of the need to use multiple 

sensors is beneficial, especially when considering the 
miniaturization trend seen in conventional electronic 
applications. However, if desired, source separation 
can also be performed using multiple sensors. 

25 [0015] Further features of the invention, its nature 

and various advantages will be more apparent from the 
accompanying drawings and the following detailed 
description of the preferred embodiments. 

Brief Description of the Drawings 



30 



[0016] FIG. 1 shows a block diagram that illustrates 

transformation of a source in accordance with the . 
principles of the invention. 
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[0017] FIG. 2 shows a block diagram of multiple 

sources that are each located in a particular location 
and being received by a sensor in accordance with the 
principles of the invention. 
5 [0018] FIG. 3 shows a flowchart for generating a 

signal dictionary in accordance with the invention. 

[0019] FIG. 4 shows a flowchart for separating 

sources in accordance with the invention. 

[0020] FIG. 5 shows two illustrative graphs 

10 depicting the results of source separation, with one 
graph showing results without using directional 
filtering and the other showing results using 
directional filtering in accordance with the invention. 

[0021] FIG. 6 shows an illustrative system for 

15 performing source separation in accordance with the 
invention. 

Detailed Description 

[0022] In accordance with the present invention, 

systems and methods are provided to separate multiple 

20 sources using cues derived from filtering imposed by 
the head and pinnae on sources located at different 
positions in space. The present invention operates on 
the assumption that each source occupies a particular 
location in space, and that because each source 

25 occupies a particular location, each source exhibits 
properties or characteristics indicative of its 
position. These properties are used as cues in 
enabling the invention to separate sources. 
[0023] Referring to FIG. 1, source 110 emits a 

30 signal, represented here as x(t) . Sensor 130 typically 
does not receive x(t) exactly as it is emitted by 
source 110, but receives a filtered version of x(t), 
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x ! (t) . That is, x(t) typically undergoes a 
transformation, as indicated by filter 120, as it 
travels from the source to the sensor, resulting in 
x 1 (t) . Several factors may contribute to the 
5 transformation or filtering of x(t) . For example, the 
environment, reverberations, distortion, echoes, 
delays, frequency- dependent attenuation, and the 
location of the source may be factors accounting for 
the transformation of the source x(t) . 

10 [0024] The present invention approximates the 

transformation process of signals through the 
application of directional filters such as head-related 
transfer functions ( "HRTFs " ) . In general, directional 
filters modify a source x(t) according to its position 

15 to generate a filtered source x' (t) . An advantage of 
directional filters is that they can be used to 
incorporate factors, as mentioned above, that affect a 
source x(t). Using these directional filters, the 
present invention generates signal dictionaries that 

20 hypothesize how each source x(t) will be received by a 
sensor after that source has undergone a 

transformation. The invention is then able to separate 
the sources utilizing the signal dictionary and a 
composite signal received by the sensor. 

25 [0025] FIG. 1 also shows two different domains, 

"source space" and "sensor space," that will be 
referred to herein. Source space is source-oriented 
and refers to sources that have not been subject to 
filtering, indicating that the signals emitted by 

3 0 sources have not undergone a transformation. Sensor 
space is sensor-oriented and refers to sources that 
have undergone transformation and are received by the 
sensor. One advantage of the invention is that it can 
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reconstruct sources in sensor space, source space, or 
both . 

[0026] FIG. 2 shows an illustration of multiple 

sources xi-x 5 disposed in distinct locations about 
5 sensor 210. This illustrates an assumption of the 

invention that each source occupies a distinct position 
in space, and has a corresponding directional filter, 
shown as hi-h 5 . Sources Xi-x 5 may simultaneously emit 
signals that are being received by sensor 210. The 
10 combination or mixture of the signals being emitted by 
sources Xi-x 5 may form a composite signal, which is 
received by sensor 210. 

[0027] ^ The composite signal y(t) received by 
sensor 210 can be defined by the sum of filtered 
15 sources: 

where * indicates convolution, h±(t) represents a 
directional filter of the ith source, and x± ( t) 
represents the ith source. Note that (t) indicates 

20 that the signals are time-varying signals. Persons 
skilled in the art will appreciate that the 
relationship defined in equation 1 is not absolute, but 
merely illustrative. Moreover, even though equation 1 
represents the time-domain, persons skilled in the art 

2 5 will appreciate that source separation can be performed 
in a transform domain such as the frequency domain. 
[0028] Equation 1 illustrates a general framework 

from which the sources are separated. Sources Xi (t) can 
be reconstructed from the composite signal y(t) 

30 received by sensor 210 using the knowledge of the 

directional filters h ± (t) . To illustrate this point, 
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FIG. 2 shows that each source Xi-x 5 undergoes 
transformation by its respective filter hi-h 5 . The 
resulting filtered sources x 1 -x 5 are received by 
sensor 210 as a composite signal y(t). Thus, the 
5 composite signal y(t), which is the summation of the 
filtered sources, is known and is used as a known 
variable in source separation. Because each source 
exhibits certain properties based on its location, 
these properties can be approximated by directional 
10 filters hi-h 5 . The directional filters provide another 
known variable that can be used in source separation. 
Thus, the sources can be separated using the composite 
signal and knowledge obtained from the directional 
filters . 

15 [0029] An advantage of the invention is that it can 

separate many types of signals. For example, the 
signals can include, but are not limited to, acoustic 
signals, radiowaves, light signals, nerve pulses, 
electromagnetic signals, ultrasound waves, and other 

20 types of signals. For the purposes of clarity and 

simplicity, the various embodiments described herein 
refer to acoustic or sound sources. 

[0030] A source x± (t) can be represented as the 
weighted sum of many basis signals 

2 5 *,(<) = {2) 

J 

where the weighting of a particular basis signal's 
(i.e., dj(t)) contribution to source i is c±j . The 

coefficient c±j typically represents the amplitude 
(e.g., volume) of the source. The signal dj(t) 
30 represents a "pure" or unfiltered signal (i.e., a 

representation of a signal as it is emitted 

substantially directly by the source) . Note the 
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relationship shown in equation 2 is merely illustrative 
of one way to define a source and that it is understood 
that there are potentially endless variations in 
defining sources. 
5 [0031] Because it is known that the composite signal 

is the sum of the filtered sources, equation 2 can be 
rewritten as 

u u 
where d'^t) = h ± (t)*dj (t) is introduced to represent 

10 filtered copies of dj(t). The filtered signal d' ±j (t) 
represents a hypothesis of how a signal sounds if it 
originates from a particular location. Thus, the 
directional filter modifies the properties of the 
signal to take on the properties of a signal 

15 originating from a particular location. 

[0032] Equation 3 illustrates a more specific 

framework from which the invention can separate 
sources. Equation 3 shows three variables, y(t) , c±j, 
and d^t). Two of these three variables are known: 

20 y(t) , which is the composite signal received by the 
sensor, and d ±i (t', § which is an entry in a signal 
dictionary. (Signal dictionaries are discussed below) . 
Because there is only one unknown in an equation of 
three variables, the unknown variable, Cij, can be 

25 solved. The invention can use mathematical techniques 
to solve for the unknown variables. For example, the 
unknown coefficients can be solved using linear 
algebra. When the coefficients are solved, the 
invention can reconstruct one or more desired sources 

30 forming the composite signal. 
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[0033] In general, signal dictionaries include many 

different signals. The present invention may use two 
different signal dictionaries: a pre-filter signal 
dictionary and a post-filter signal dictionary. 
5 Construction of the signal dictionaries is variable. 
For example, they may be generated as part of a pre- 
processing step (e.g., prior to source separation) or 
they may be generated, updated, or modified while 
performing source separation. Furthermore, the signal 

10 dictionaries may be subject to several predefined 

criteria while being constructed (discussed below) . 
[0034] FIG. 3 shows steps for generating a post- 

filter signal dictionary that enables the invention to 
separate sources in accordance with the principles of 

15 the present invention. Step 310 shows that a pre- 
filter signal dictionary is provided. A pre-filter 
signal dictionary includes a predetermined number of 
basis functions, d(t), as shown in box 315. Each basis 
function represents a brief waveform of which a 

2 0 reasonably small number can be combined to form a 

signal of interest. Moreover, each basis function may 
represent a brief waveform as it is emitted 
substantially directly from a source, irrespective of 
the source's location. Thus, a basis function forms 

25 part of a source. For example, the dij(t) in equation 2 
may be duplicated in the pre-filter signal dictionary. 
[0035] The basis functions may be chosen based on 

two criteria. First, sources are preferably sparse 
when represented in the pre-filter signal dictionary. 

30 In other words, in a sparse representation, the 

coefficients Cij used to represent a particular source 
Xi(t) have a distribution including mostly zeros and 
"large" values. An example of such a distribution of 
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coefficients can be governed by a Laplacian 
distribution. A Laplacian distribution, as compared to 
a Gaussian distribution, has a "fatter tail" and 
therefore corresponds to a sparser description. 
5 [0036] Second, basis functions dj(t) may be chosen 

such that, following transformation by a filter (e.g., 
a HRTF filter) , the resulting filtered copies of a 
particular basis function differ as much as possible. 
This improves the accuracy of the estimated 

10 coefficients. 

[0037] It is noted that methods and techniques for 

constructing pre- filter signal dictionaries are known 
by those with skill in art and need not be discussed 
with more particularity. See, for example, Neural 

15 Computation (Vol. 13, No. 4, 2000 and in particular 

pp. 863-882) for a more detailed discussion of signal 
dictionaries . 

[0038] At step 320, the directional filters are 

provided. Directional filters may modify the basis 

20 functions of the pre-filter signal dictionary so that 
the modified basis functions take on properties 
indicative of such basis functions being emitted by a 
source positioned at a particular location. The number 
of directional filters provided and the complexity of 

25 directional filters may vary depending on any number of 
factors, including, but not limited to the type of 
signals emitted by the sources, the number of sensors - 
used, and pre-existing knowledge of the sources. 
Box 325 shows that a predetermined number of filters 

3 0 may be provided. 

[0039] At step 330, a post-filter signal dictionary 

is generated using the pre-filter signal dictionary and 
the directional filters. A post-filter signal 
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dictionary includes copies of each basis function as 
filtered by each filter (provided at step 320) . Each 
element of the post- filter signal dictionary is a 
filtered basis function, which is denoted by 
5 d' ij (t)= h^djit). Thus, each filtered basis function 

approximates how a particular basis function is 
received (by a sensor) if that basis function 
originates from a source at a particular location. 
Box 335 shows filtered basis functions that can be 
10 obtained by convolving the contents of boxes 315 and 
325 . 

[0040] The elements of the post-filter signal 

dictionary may represent filtered signals d'^it) forming 

part of the composite signal received by the sensor. 
15 Therefore, if the filtered signals are contained within 
the post-filter signal dictionary, this provides a 
known variable that can be used to separate the 
sources . 

[0041] FIG. 4 shows a flow chart illustrating the 

20 steps of separating sources in accordance with the 

principles of the invention. Beginning at step 410, 
the sensor receives a composite signal. As stated 
above in connection with equation 3, the composite 
signal is the sum of the filtered sources, where each 
25 filtered source is further characterized as having at 

least one filtered basis function (signal) and at least 
one coefficient corresponding to each filtered basis 
function (signal) . 

[0042] At step 420, the coefficient of each source 
30 is estimated using the composite signal and the post- 
filter signal dictionary that was generated through the 
application of directional filters. This step can be 
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performed by solving for the coefficients Cij in, for 
example, equation 3. The coefficient Ci-, is solvable 
because the composite signal is known and the filtered 
basis functions, which may be provided in the post- 
5 filter signal dictionary, are also known. Persons 
skilled in the art will appreciate that there are 
several different approaches for solving for each 
coefficient. For example, in one approach, a sparse 
solution of the coefficients may be solved. In another 
10 approach, a convex solution of the coefficients may be 
solved . 

[0043] To solve for the coefficients, the composite 

signal may be characterized as a mathematical equation 
using some form of the relationship y=Dc . This can be 

15 accomplished by separating y(t) into discrete time 

slices or samples tl, t2, ... tM. This is sometimes 
referred to as descretizing the signals. Once 
descretized, equation 3 can be rewritten in matrix 
form, as shown in equation 4 : 

2 0 y=Dc ( 4 ) 

where c is defined as single column vector containing 
all coefficients c ijf with the elements indexed by i and 
j , and D is a matrix whose k-th row holds the elements 
d' 4 Jt„) . The columns of D are indexed by and i and j , 

2 5 and the rows are indexed by k. Y is a column vector 

whose elements correspond to the discrete- time sampled 
elements y (t) . 

[0044] The coefficients can be obtained by solving 

for c in equation 4. The y variable is known because 
30 it is obtained from the received composite signal y(t) 
and the D variable is known because is provided by a 
signal dictionary (e.g., a post-signal dictionary from 
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step 330 of FIG. 3) generated through the application 
of directional filters. 

[0 045] An advantage of the invention is that many 
factors can be taken into account when solving for the 
5 coefficients while still accurately separating the 
sources. For example, one factor can include the 
knowledge or information (e.g., position of sources, 
the number of sources, the structure of the signals 
emitted by the sources, etc.) that is known about the 

10 sources. The knowledge of the sources may determine 
whether the source separation problem is tractable 
(e.g., solvable). For example, there may be instances 
in which there is considerable prior knowledge of the 
sources (in which case the source separation problem is 

15 relatively simple to solve) . In other instances, 

knowledge of the sources is relatively weak, which is 
typically the case when source separation is being used 
in practice (e.g., blind source separation). 
[0046] The techniques used to solve for c may vary 

20 depending on the post-filter signal dictionary. For 
example, if the signal dictionary forms a complete 
basis, c can be obtained from c = D~ x y- A signal 
dictionary that forms a complete basis may be provided 
when the prior knowledge of the sources is substantial 

25 (e.g., the position of each source is known) . In a 

complete basis, there is a one-to-one correspondence of 
filtered basis functions in the signal dictionary to 
filtered basis functions received in the composite 
signal . 

30 [0047] However, in the case where the post-filter 

signal dictionary forms an overcomplete basis, many 
different solutions for c may be obtained. This is 
sometimes the case when the knowledge of the sources is 



- 16 - 



relatively weak. The solutions may be obtained solving 
for c, for example, in the pseudo- inverse c = D*y. An 
overcomplete post-filter signal dictionary includes 
more filtered basis functions then necessary to solve 
5 for the coefficients. This excess results in a system 
that is underdetermined (i.e., there are many possible 
combinations of filtered basis functions that can be 
used to replicate sources in the composite signal 
y(t) .) 

10 [0048] In the undetermined case, it is desirable to 

select a solution with the highest log-probability 
corresponding to the sparsest solution. This can be 
accomplished by introducing a regulariser that 
introduces an assumption that the coefficients can be 

15 represented as a distribution (e.g., a Gaussian, 

Laplacian, or Bayesian distribution) . This assumption 
can be expressed as condition on the norm of the c 
vector (in equation 4) . The condition can require, for 
example, a c to be found that minimizes the L p norm ||c|| 

2 0 subject to Dc=y, where 

tt,-(LM'Y (5) 

[0049] Thus, different choices of p (e.g., a p of 0, 

1, or 2) correspond to different assumptions (e.g., 
distributions) and yield different solutions. For 
25 example, if p is 1, the following condition is solved 

minimize subject to y=DC (6) 

v 

It will be understood that the condition set forth in 
equation 11 can be determined using linear programming. 
Thus is seen that the regulariser provides the prior 

3 0 knowledge of the sources needed to solve for the 
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coefficients when no such prior information is actually- 
known . 

[0050] It is understood that the condition Dc=y can 

be relaxed. That is, the L p norm of c can be determined 
5 if Dc=y is approximately matched, as opposed to being 
exactly matched. Relaxing this constraint 
advantageously enhances the robustness of the source 
separation algorithm according to the invention, 
thereby enhancing it applicability to source separation 
10 problems. 

[0051] For example, relaxing the constraint provides 

source separation in the presence of noise. Noise may 
be attributed to the sensor, itself (e.g., caused by 
sensor design limitations) , or to ambient noise 
15 impinging on the sensor. Noise can be taken into 
account by modifying equation 6 to include a noise 
process to 

minimize \\c\l subject to\DC - y\\ p < J3 (7) 

where (5 is proportional to a noise level and p = 1, 2, 
20 or oo. 

[0052] Another technique to compensate for noise is 
to introduce a vector e of "error slop" variables in 
the optimization (of equation 6) . The magnitude of the 
"error slop" variables is controlled by an allowable 
25 parameter e. This error vector is then incorporated 

into a modified form of equation 6 such that objective 
is to either 

minimize \c\ x subject toy = Dc + e and [je^ < e ( 8 ) 

or 

3 0 minimi ze \\c\\ } subject toy = Dc + e and Hefl^ < e ( 9 ) 

or 

minimize \\c\\ x subject to y = Dc + e and \\e\\ 2 < <= (10) 
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all of which can be used to solve unique solutions of 
the unknown coefficients. 

[0053] When the coefficients are obtained, the 
sources may be reconstructed. Steps 430A and 430B show 
5 reconstruction of the sources in "sensor space" and in 
"source space," respectively. Either one or both 
reconstruction steps may be performed to reconstruct 
the source . 

[0054] "Sensor space" reconstruction of step 430A 

10 reconstructs filtered sources. Such reconstruction can 
be performed using the following equation: 

y i (t) = c iJ d iJ (t) (11) 

where y± (t) is the particular source being reconstructed 
in "sensor space," c±j represents the coefficients 
15 estimated for this source (in step 420) , and d^. 

represents the filtered basis functions of this source. 
[0055] "Source space" reconstruction of step 430B 

reconstructs sources as if each source had not been 
filtered, but as if the source was emitted 

20 substantially directly from the source. An advantage 

of source separation is that it "de-echoes" each of the 
reconstructed sources because there is no need to use 
the post-filter signal dictionary. "Source space" 
reconstruction reconstructs each source using the 

25 estimated coefficients (obtained from step 420) and the 
basis functions of the pre-filter signal dictionary. 
For example, a de -echoed source can be reconstructed 
using equation 2. 

[0056] FIG. 5 shows two graphs illustrating how the 

30 invention can separate sources in an acoustic 

environment. Graph 500 shows the results of source 
separation without the use of directional filters and 
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graph 550 shows the results of source separation with 
the use of directional filters. 

[0057] Graphs 500 and 550 both show sources 1, 2, 

and 3 on the x-axis and the amplitudes of notes played 
5 by each source on the y-axis. Both graphs also show 

the actual coefficients, a LI norm of the coefficients, 
and a L2 norm of the coefficients. The LI and L2 norms 
refer to the minimization condition, shown in 
equation 7, where LI (p=l) refers to a Laplacian 
10 assumption and L2 (p=2) refers to a Gaussian 
assumption . 

[0058] For purposes of illustration assume that each 

source can play notes drawn from a 12 -tone (Western) 
scale. Further assume that each source occupies an 

15 unknown location and simultaneously plays two notes. 

The actual values of these two notes are shown by the 
circles in graphs 500 and 550. Each note has a 
fundamental frequency F and has harmonics thereof nF (n 
being 2, 3, ... n) . The amplitude of the harmonics is 

20 defined by l/n. Thus, the basis functions included in 
the pre-filter signal dictionary may be defined by 

d x ^Y-sinflTrnFj) (12) 

where F i= 2 i/12 F Q is the fundamental frequency of the ith 
note, and F Q is the frequency of the lowest note. 

25 [0059] In graph 600, in which no directional 

filtering is used, both the LI and L2 norms were not 
able to accurately determine the coefficients. Because 
no directional filters were used, the solutions were 
obtained using the pseudo-inverse of the pre-filter 

30 signal dictionary. The L2 norm solution resulted in a 
Gaussian distribution of the coefficients, all of which 
are incorrect. The LI norm solution resulted in a 
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sparse solution for the non-zero coefficients, but the 
absence of the post-filter signal dictionary prevented 
the solution from being able to correctly identify all 
of the coefficients . 
5 [0060] Graph 550 shows that the use of directional 

filtering enhances source separation. In this case the 
LI and L2 norms operated in connection with a post- 
filter signal dictionary. Graph 550 shows that the LI 
norm is able to accurately separate the sources, while 
10 the L2 norm solution remained poor. The difference in 
the performance of the norms shows that a sparseness 
assumption, expressed as a distribution over the 
sources, enable source separation to be performed 
accurately . 

15 [0061] FIG. 6 shows an illustrative system 600 that 

utilizes the source separating algorithm in accordance 
with the principles of the invention. System 600 may 
include sensor 610, processor 620, storage device 630, 
and utilization circuitry 640. Processor 620 may 

2 0 communicate with sensor 610, storage device 63 0 and 

utilization circuitry 640 via communications bus 660. 
[0062] It will be understood that the arrangement 

shown in FIG. 6 is merely illustrative and that 
additional system components may be added or existing 

2 5 components may be removed or integrated. For example, 
processor 620 and storage device 630 may be integrated 
into a single unit capable of providing both processing 
and data storage functionality. If desired, system 600 
may optionally include additional sensors 650. 

30 [0063] Sensor 610 and optional sensors 650 provide 

data (e.g., received auditory signals) to processor 620 
via communications bus 660. The type of sensors used 
in system 600 may depend on the signals being received. 
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For example, if acoustic signals are being monitored, a 
microphone type sensor may be used. Specific examples 
of such microphones may used in hearing aids or cell 
phones . 

5 [0064] Processor 620 receives the data and applies a 

source separation algorithm in accordance with the 
invention to separate the sources. Processor 620 may, 
for example, be a computer processor, a dedicated 
processor, a digital signal processor, or the like. 

10 Processor 620 may perform the mathematical computations 
needed to execute source separation. Thus, the 
processor solves for the unknown coefficients using the 
data received by sensor 610. In addition, 
processor 620 may, for example, access information 

15 (e.g., a post-filter signal dictionary) stored at 
storage device 63 0 when solving for the unknown 
coefficients . 

[0065] Storage device 630 may include hardware such 

as memory, a hard drive, or other storage medium 
20 capable of storing, for example, pre- and post-filter 
signal dictionaries, directional filters, algorithm 
instructions , etc . 

[0066] The data stored in storage device 630 may be 

updated. The data may be updated at regular intervals 

25 (e.g., by downloading the data via the internet) or at 
the request of the user (in which case the user may 
manually interface system 600 to another system to 
acquire the updated data) . During an update, improved 
pre-filter signal dictionaries, directional filters, or 

30 post-filter signal dictionaries may be provided. 

[0067] Storage device 630 may have stored therein 

several pre-filter dictionaries and directional 
filters. This may provide flexibility in generating 
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post-filter signal dictionaries that are specifically 
geared towards the environment in which system 600 is 
used. For example, system 600 may analyze the 
composite signal and construct a post -filter signal 
5 dictionary based on that analysis. This type of "on- 
the-fly" analysis can enable system 600 to modify the 
post-filter signal dictionary to account for changing 
conditions. For example, if the analysis indicates a 
change in environment (e.g., an indoor to outdoor 

10 change) , system 600 may generate a post-filter signal 
dictionary according to the changes detected in the 
composite signal. Hence, system 600 may be programmed 
to use a pre-filter signal dictionary and directional 
filters best suited for a particular application. 

15 [0068] Utilization circuitry 640 may apply the 

results of source separation to a particular use. For 
example, in the case of hearing aid, utilization 
circuitry 640 may be an amplifier that transmits the 
separated sources to the user's ear. If desired, 

20 system 600 may reconstruct a portion (e.g., desired 
sources) of the sources forming the composite signal 
for transmission to utilization circuitry 640. 
[0069] Thus it is seen that multiple sources can be 

separated and reconstructed using directional dependant 

25 filtering. Those skilled in the art will appreciate 
that the invention can be practiced by other than the 
described embodiments, which are presented for purposes 
of illustration rather than of limitation, and the 
invention is limited only by the claims which follow. 



